A Clustering Analysis of the Morphology of the 130 GeV Gamma-Ray Feature 
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Recent observations indicating tiie existence of a monochromatic 7-ray line with energy ~130 GeV 
in the Fermi-LAT data have attracted great interest due to the possibility that the line feature 
stems from the annihilation of dark matter particles. Many studies examining the robustness of the 
putative line-signal have concentrated on its spectral attributes. Here, we study the morphological 
features of the 7-ray line photons, which can be used to differentiate a putative dark matter signal 
from astrophysical backgrounds or instrumental artifacts. Photons stemming from dark matter 
annihilation will produce events tracing a specific morphology, with a statistical clustering that 
can be calculated based on models of the dark matter density profile in the inner Galaxy. We 
apply the DBSCAN clustering algorithm to Fermi 7-ray data, and show that we can rule out the 
possibility that 1 (2, 4) or fewer point-like sources produce the observed morphology for the line 
photons at a 99% (95%, 90%) confidence level. Our study strongly disfavors the main astrophysical 
background envisioned to produce a line feature at energies above 100 GeV: cold pulsar winds. It is 
highly unlikely that 4 or more such objects have exactly the same monochromatic cosmic-ray energy 
needed to produce a 7-ray line, to within instrumental energy resolution. Furthermore, we show 
that the larger photon statistics expected with Air Cherenkov Telescopes such as H.E.S.S.-II will 
allow for extraordinarily stringent morphological tests of the origin of the "line photons" . 

PACS numbers: 98.70.Rz, 95.55.Ka, 95.35. -l-d, 97.60. Gb 



I. INTRODUCTION 

The launch of the Fermi Large Area Telescope (LAT) 
in 2008 has allowed for a greatly expanded view of the 
7-ray sky, including a significantly enhanced energy and 
angular resolution, compared to previous missions 1]. 
These characteristics have allowed, in particular, for a 
thorough investigation of the extremely dense popula- 
tion of high-energy 7-ray sources in the Galactic center 
(GC) region [2]. The GC region is known to host such 
diverse 7-ray sources as supernova remnants [3], highly 
ionized gas [1], dense molecular clouds [S], massive 0/B 
stars [6] , both young and recycled pulsar populations [7] , 
as well as being the densest region of dark matter in the 
Galaxy [5|. Notably, no other location in the sky is ex- 
pected to provide a signal from dark matter annihilation 
which is as bright as the GC (for a recent general review 
of gamma-ray searches for signals from dark matter an- 
nihilation see Ref. [H"). While this makes the GC region 
an extremely interesting location for a multitude of sci- 
entific studies, it also means that additional information, 
such as characteristic spectra for each source class, must 
be carefully considered in order to separate the desired 
signal from the bright background. 

Recently, Bringmann et al. P,0 and Weniger [TT] found 
indications for a unique spectral signature in observations 
of the region surrounding the GC, which is consistent 
with dark matter annihilation. Specifically, in sky re- 
gions optimized for large signal-to-noise ratios for various 
dark matter density profiles, they observed an excess of 
photons with an energy spectrum resembling a 130 GeV 



7-ray line, smeared by the finite energy resolution of the 
Fermi-LAT telescope. The significance of this excess was 
found to be 3.2a globally [TT] - enough to make the fea- 
ture interesting on statistical grounds. The feature is 
strongest when using regions of interest that have been 
optimized for dark matter density distributions follow- 
ing: (1) a Navarro-Frenk- White (NFW) profile flT, (2) 
an Einasto profile [13], and (3) a generalized NFW pro- 
file, with a radial slope governing the dark matter density 
profile (r~") set to a = 1.15, similar to what could result 
from adiabatic contraction [lU [15] . The reported feature 
is much weaker for profiles where the dark matter density 
is cored near the GC. 

A monochromatic 7-ray line has been long consid- 
ered the "Holy Grail" for dark matter indirect detec- 
tion, given the difficulty of producing a high-energy 
monochromatic signal with ordinary astrophysical pro- 
cesses. Thus, this observation prompted a number of 
follow-ups. Profumo and Linden ,16^ noted that the ob- 
servation of a monochromatic 7-ray signal could be qual- 
itatively mimicked by an additional power-law compo- 
nent which breaks strongly at an energy of 130 GeV, and 
posited the Fermi bubbles [17] as a possible source for this 
excess (although rather strong breaks are required to fit 
the data, see Ref. [S])- Most notably, Su and Finkbeiner 
[18) localized the emission to within approximately 5° 
of the GC, finding a 6.5cr (5.0(7 after including a trials 
factor) preference for a line signal following an off-center 
Einasto profile compared to the assumption of a simple 
power-law background (see also il9 ). This finding dis- 
putes the implication of the Fermi bubbles as a source 
for the excess, as the latter are observed to extend over a 



much larger emission region. The ofF-centered nature of 
the dark matter profile is at the moment only marginally 
statistically significant 1201 HI]- However, if confirmed, 
an off-center peak would pose a challenge to traditional 
models of the Galactic dark matter density which assume 
the largest dark matter density to fall on top of the peak 
of the baryonic mass density^. However, a recent anal- 
ysis by Kuhlen et al. [22] found that the peak in the 
dark matter density may, in fact, be displaced by hun- 
dreds of parsecs from the dynamical center of the Galaxy 
- although this scenario might be incompatible with the 
assumption of a cusped profile. 

More data are needed to clarify whether the 130 GeV 
feature in the data persists with larger statistics or if it is 
only a statistical fluke. Since the observation of a 7-ray 
line is extremely sensitive to inaccuracies in the energy- 
reconstruction of 7-rays observed by Fermi-LAT, a great 
deal of interest has also focused on searching for possible 
instrumental abnormalities affecting the photons belong- 
ing to the 7-ray line. The most notable characteristic 
of any such instrumental effect would be the observation 
of line activity either across the entire sky, or across a 
certain region of instrumental phase space. Interestingly, 
some early results found an excess of 130 GeV events in 
observations of the Earth- limb [TSl ^^ HI] . This is trou- 
bling, as the vast majority of limb photons are known to 
result from the di-photon n'^ decay spectrum created as 
cosmic-ray protons interact with the upper layers of the 
Earth's atmosphere. There is no conceivable model in 
which dark matter annihilation could create a 7-ray line 
in this region [T5]. However, this line activity was not 
detected along the Galactic plane, which provides signif- 
icantly more photons than the GC, and inhabits similar 
regions of instrumental phase space. A comprehensive 
study by Finkbeiner et al. '23] did not find any significant 
evidence for systematic features in the energy reconstruc- 
tion of the Fermi-LAT, which would be able to artificially 
produce a 7-ray line (see however Refs. [25H27] ). 

Follow up analyses by the Fermi-LAT team are cur- 
rently ongoing, but have revealed two noteworthy re- 
sults. At the time of the initial discovery of the 7-ray 
line, efforts were already ongoing to improve the energy 
normalization of the Fermi-LAT data, accounting for a 
decrease in the calculated energy of 7-rays over time due 
to radiation damage to the calorimeter. This effect lin- 
early increased the reconstructed energy of high energy 
photons, moving the line signal from 130 GeV up to ap- 
proximately 135 GeV. However, this reprocessing did not 
greatly affect any other signature of the posited line anal- 
ysis. We note that throughout the rest of this paper, we 
refer to the "130 GeV line", as we are using a version 
of the Fermi-LAT data which has not been reprocessed. 
However, all results shown here are very nearly applica- 



^ Which we assume, here, to be identical to the position of Sgr A* 
and call the GC throughout this paper. 



ble to an analysis of the 135 GeV line observed in the 
reprocessed data. 

Additionally, the Fermi-LAT analysis did uncover one 
troubling aspect of the posited 7-ray line. Employing a 
parameter CTBBestEnergyProb (which is not publicly 
available), they investigated the confidence they had in 
the energy reconstruction of each photon belonging to 
the 7-ray line. In the case where a true 7-ray line feature 
is present in the data, this should increase the statis- 
tical significance of the observation, as the proceedure 
adds additional statistical weight to the line photons 
which are most likely to have a correctly measured en- 
ergy. However, when this analysis was applied to the ob- 
served photon data, the statistical significance of the line 
feature was found to decrease moderately. This signals 
that the line feature has photons with a somewhat poorer 
energy resolution than would generally be expected [28]. 
However, further inquiry of these systematic issues is re- 
quired, as none of the systematics can clearly account for 
the entire statistical strength of the line feature. 

If interpreted as a signal of particle dark matter an- 
nihilation or decay, the large observed intensity of the 
130 GeV 7-ray line (along with strong constraints on 
the total continuum emission from additional hadronic 
states [29]) has proved a difficult, though by no means 
intractable, particle physics problem. Numerous mod- 
els have already been posited to "brighten" the 7-ray 
line |30H59| . Summarizing the myriad particle physics de- 
tails of these models lies beyond the scope of the present 
paper. 

Although often characterized as a "smoking gun" sig- 
nature for the annihilation or decay of particle dark 
matter, tentative observations of the 130 GeV line have 
spurred the question of whether any traditional astro- 
physical mechanism might mimic a line in the relevant 
energy range. Aharonian et al. 46^ argued that the only 
plausible mechanism for the creation of an astrophysical 
7-ray line is through inverse Compton scattering of am- 
bient photons by a jet of nearly monoenergetic electrons 
and/or positrons, occurring in the deep Klein-Nishina 
regime. If the latter kinematic regime holds, the photon 
acquires nearly the entire energy of the incoming lepton, 
allowing for a nearly monoenergetic lepton spectrum to 
efficiently transfer into a sharply peaked 7-ray feature. 

One possible class of astrophysical objects that 
possesses the potential to host the needed leptonic 
monochromatic "jet" , as well as the ambient photons in 
the needed energy range (here, a few eV) is cold ultra- 
relativistic pulsar (PSR) winds '60]. It is important to 
note that this scenario presents a potential difficulty in 
explaining the observed spread of 130 GeV photons be- 
yond a single point source, as different PSRs would be 
expected to exhibit 7-ray lines at different energies. One 
way to test this one astrophysical background is there- 
fore to study whether the morphology of the observed 
130 GeV photons can be reproduced with a small num- 
ber of point sources or not. This is the key objective of 
the present study. 



Several other approaches have tested the dark matter 
nature of the 130 GeV photons. For example, any dark 
matter interpretation of the 130 GeV line implies addi- 
tional regions of interest for follow up searches, where the 
so-called J-factor (the line-of-sight integral of the dark 
matter density squared, smeared over the instrumental 
point-spread function) is expected to be largest. Most 
importantly, dwarf spheroidal galaxies and galaxy clus- 
ters have been singled out as promising regions to search 
for a dark matter signal. Observations of Milky Way 
dwarfs have not uncovered any evidence of a 130 GeV 
signal [ST]: this is not unexpected, as the estimated an- 
nihilation cross-section to 77 implied from GC observa- 
tions predicts less than one photon to arrive from the 
population of dwarf spheroidal galaxies. Interestingly, 
Hektor et al. ;£2j argued for an observation of a 130 GeV 
line in a population of nearby galaxy clusters. However, 
it should be noted that this signature is only significant 
when very large ROIs of ~8° are considered, which is 
much larger than the expected angular size of the galaxy 
clusters under investigation. 

Using a similar method, Su and Finkbeiner |63| inves- 
tigated the population of unassociated Fermi-LAT point 
sources - that is, point sources detected by the Fermi- 
LAT instrument which have not been identified at other 
wavelengths. They found a statistically significant de- 
tection for a double 130 GeV and 111 GeV line, with 
14 unassociated sources showing evidence of a line pho- 
ton. Furthermore, they found no significant detection of 
7-ray line emission in the control sample of Fermi-LAT 
point sources that have already been associated with var- 
ious astrophysical phenomena. This caused them to con- 
clude that some portion of the unassociated point-source 
sample may contain previously unknown dark matter 
substructures. However, Hooper and Linden |64j ar- 
gue against this conclusion, noting that each unassoci- 
ated source is identified primarily based on its continuum 
emission between energies of 100 MeV-10 GeV, rather 
than based on the detection of a single line photon. The 
intensity and spectrum of this continuum emission can 
then be compared to the signal from dark matter anni- 
hilation to any final state producing a 7-ray continuum, 
and is expected in order to produce the thermal relic 
abundance of dark matter. They find that for at least 12 
of the 14 indicated unassociated point sources, the con- 
tinuum emission is not compatible with any dark matter 
annihilation pathway. Furthermore, they argue that the 
latitude distribution of the identified sources is not con- 
sistent with that expected from any model of dark matter 
subhalo formation. A second analysis by Mirabal ,65] ar- 
gued that while these 14 sources remain unidentified, at 
least 12 of the sources (not identical to those from ^^) 
are spectrally strongly consistent with AGNs. 

In addition to considering the energy signature of the 
130 GeV line, understanding the morphology of the pho- 
tons belonging to the source class producing the observed 
feature will be key to elucidate the physics behind the 
line phenomenon. Notably, the point-spread function 



of front-converting events at energies near 100 GeV ap- 
proaches 0.1° [1], which is significantly smaller than the 
^5° region of interest implicated by [IH], allowing for the 
actual morphology of the line emission to be closely mir- 
rored by Fermi-LAT observations. To first order, obser- 
vations indicating a morphology consistent with widely 
accepted dark matter density profiles would provide ad- 
ditional evidence for a dark matter interpretation (the 
data indeed points to that direction, see e.g. Fig. 3 of 
Ref. [9]), while measurements consistent with either a 
population of point sources, or a significantly flattened 
profile may point to other astrophysical or instrumental 
interpretations. 

In this paper, we examine the morphology of photons 
belonging to the 7-ray line more closely, analyzing with 
a sound statistical approach the distribution of the ar- 
rival direction of photons by employing a clustering al- 
gorithm to pinpoint the correlation between the arrival 
directions of photons putatively belonging to the line fea- 
ture. We then compare these results against simulated 
models where the line is produced by dark matter anni- 
hilation or by an astrophysical process associated with a 
few point sources (for example a handful of PSRs). The 
key result of our study is that current data disfavor a sce- 
nario where the line photons stem from 4 or fewer point 
sources. Given how unlikely it is that 4 or more pulsars 
produce a gamma-ray line at exactly the same energy 
(within the LAT energy resolution), our study disfavors 
the PSR scenario over a truly diffuse and un-clustered 
origin for the photons. 

In Section [n] we describe the data employed in our ob- 
servations of the 7-ray line feature, the specifics of the al- 
gorithms used to determine the photon morphology, our 
models for both the annihilation of dark matter and emis- 
sion from PSRs, and the diffuse background in the region. 



In Section III we present the results of our study both 
for current Fermi observations, and for projections for 
upcoming observations of the GC with the Atmospheric 
Cherenkov Telescope (ACT) H.E.S.S.-H. Finally, in Sec- 
tion |IV] we discuss the interpretation of the results, and 
present our conclusions. 



II. MODELS 

A. Photon Selection 

In order to analyze the population of photons stem- 
ming from the putative 7-ray line emission, we must make 
a photon selection which isolates the line photons from 
those correlating to background events. We follow here 
the same photon selection employed in Ref. [66', which 
provides the location of observed Fermi-LAT photons in 
three energy bands, 70-110 GeV, 120-140 GeV, and 150- 
300 GeV over a 10° square window centered on the GC. 
Since the Fermi-LAT energy resolution is approximately 
10% at 130 GeV, we assume photons in the 120-140 GeV 
band to encompass the photons related to the 7-ray line 



observation, while photons in the low and high energy 
bands correspond to background events not associated 
with the 7-ray line observation. Below, a power-law fit 
to the sidebands will be used to fix the background rate 
in our 120-140 GeV simulations while the remainder of 
the photon excess will make up the signal. We note that 
there is some evidence for a second line at energies of 
around 111 GeV [IH], however the weak significance of 
that feature makes its impact on the population of 70- 
110 GeV 7-rays negligible. 

In comparing the photon morphology from the line re- 
gion against the "side-band" photons, we assume that the 
background morphology remains approximately invari- 
ant throughout the 70-300 GeV energy range. This as- 
sumption is warranted in light of observations indicating 
that the primary component of diffuse emission through 
this region stems from Tr'^-emission tracing the Galac- 
tic gas [29l |67j. While some unresolved point-sources 
may also be present, it is unlikely that any given source 
contributes multiple photons to the observed high-energy 
7-ray emission, making the spectral features of each in- 
dividual source irrelevant. 

Before proceeding with the description of the cluster- 
ing algorithm wc employ in this analysis, we describe in 
the following sections the simulated data sets we use to 
validate our analysis. Section JIB details the simulated 



line events from both dark matter annihilation scenarios 
with different dark matter density profiles, and scenarios 
with one or more point sources. Section |IIC 



details the 



simulated background events. Finally, sec. II D describes 
the clustering algorithm we employ in the present study. 



B. Dark Matter and Pulsars Models 

In order to establish a quantitative measure for the 
clustering properties of 7-rays due to either dark matter 
or one or more point-sources in the GC region, we pro- 
duce Monte Carlo simulations of the expected positions 
of photons stemming from each model. 

In the case of PSRs, we examine models featuring 
between 1 and 6 point sources to explain the excess 
130 GeV emission. We randomly pick the distribution 
of each point source following a surface density distribu- 
tion p(r) ex r~^'^, as motivated by the observed density 
distribution of 0/B stars in the inner Galaxy [SS] and 
we produce an excess number of photons which are dis- 
tributed randomly (assuming equal brightness) between 
the simulated pulsars. 

In the case of dark matter annihilation, we predict the 
annihilation signal to follow the integral over the line of 
sight of the square of the dark matter density. We choose 
two independent dark matter density profiles motivated 
by models of dark matter structure formation. We first 
examine a generalized Navarro-Frenk- White 12J profile, 
with a density profile 
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-3+a 



(1) 



In our standard analysis we choose a — 1 and r^ — 22 kpc 
fitting the best numerical results from the Aquarius simu- 
lation [13J . In order to evaluate the effect of changing the 
dark matter density profile, we also consider an Einasto 
profile with a density distribution [13] : 



p{r) ex exp 




- 1 



(2) 



assuming, here, that a — 0.17 [IB]. In each case, we 
assume that the annihilation rate is proportional to p^ (r) , 
and then integrate over the line of sight from the solar 
position R0 = 8.3 kpc '69' in order to generate the dark 
matter morphology that would be observed by the Fermi- 
LAT. 

We additionally consider two alternative profiles. 
First, the case of decaying dark matter following the 
NFW profile given in Eq. (fTl), with the decay rate now 
proportional to /o(r). Second, the case of isotropic emis- 
sion, i.e. a uniform surface profile. Since the cluster- 
ing properties of the source are highly dependent on the 
number of observed photons, we calculate for Fermi-LAT 
(H.E.S.S.-II), 10^ (2000) realizations of 48 (5000) photons 
following the distribution assumed in each of these cases 
over a 10° (4°) square window. For H.E.S.S.-II observa- 
tions, we estimate the number of photons from a rela- 
tively short exposure time, on the order of a 6.25 hours, 
using an effective area given for the H.E.S.S.-II telescope 
with the flux in the 130 GeV energy range measured by 
the Fermi-LAT. 

In each case, we must also consider the smearing of 
target photons based on the point-spread function of the 
Fermi-LAT telescope. In order to accomplish this accu- 
rately for observations at the GC, we employ the Fermi 
tools to estimate the point-spread function for photons 
entering both the front and back of the instrument at 
different 0-angles. Specifically, we employ the gtpsf tool 
developed by the Fermi-LAT collaboration in order to 
calculate the effective PSF given the total exposure of 
the GC region from all locations in the Fermi-LAT in- 
strumental phase space (i.e. how much was the GC 
viewed from different spacecraft orientations). For the 
selected observation period, the average (68%, 95%) con- 
tainment radius over the observation is (0.124°, 0.529°) 
for front-converting events (~ 56% of exposure area) and 
(0.258°, 0.907°) for rear converting events (~ 44% of ex- 
posure area). The resulting PSF for each photon (sig- 
nal and background) is randomly chosen based on this 
weighted average of instrument coordinates and the in- 
coming photon is smeared based on the given PSF. In the 
case of Atmospheric Cherenkov Telescope (ACT) simula- 
tions, the PSF depends somewhat sensitively on the angle 



of incidence of the incoming photons. Following Aharo- 
nian et al [70] we approximate the H.E.S.S. point-spread 
function as an energy independent, two-component Gaus- 
sian with the probability density of an event smearing to 
radius 9 given by, 



P{e) = AQ 



exp 



2cri 



Arel exp 



2cr2 



(3) 



Where cti = 0.046, 02 = 0.12, and A^ei = 0.15 and an 
overall normalization A. 



The background model described above is also used 
to estimate the average background count Nb during the 
computation of the cluster significances. Nf, is calculated 
by integrating the background template, normalized to 
the correct background count over the 95% containment 
area of the cluster members (100% containment in cases 
with fewer than 20 cluster members). This allows for a 
statistical measure which traces the local morphology of 
the background and is thus minimally dependent on its 
anisotropic structure, reducing the significance of 'hot- 
spots' in the background which may be falsely identified 
as true clusters. 



C. Background Models 

In order to characterize the morphology of the ex- 
pected diffuse background we follow the detailed PASS 
7 Galactic Diffuse Model, which contains both spec- 
tral and morphological information generated by obser- 
vations of both HI and CO line surveys, which con- 
strain the distribution of interstellar gas. The 7-ray mor- 
phology and spectrum are then generated by convolv- 
ing these maps with the modeled cosmic-ray densities 
utilizing the Galprop code |7T], and calculating the ex- 
pected 7-ray emission from processes including 7r°-decay, 
bremsstrahlung emission, and inverse-Compton scatter- 
ing. Utilizing these simulations, we then generate a 
Monte Carlo population of background 7-rays following 
a morphology compatible with observations across the 
7-ray spectrum. 

In the case of the Fermi-LAT telescope, we assume zero 
cosmic-ray contamination, since we are considering only 
the GC region, which is very bright in 7-rays. Given 
the calculated intensity of the 7-ray line, we calculate 
an average of 12 signal, and 36 background photons be- 
tween an energy of 120-140 GeV. In each simulation of 
the Fermi-LAT data, we allow the strength of this signal 
to float using Poisson statistics, setting the mean inten- 
sity to be 12 signal photons. For Fermi-LAT observations 
we model a 10° square window around the GC. 

In the case of ACT observations, we note that cosmic- 
ray contamination is a much larger issue, since hadronic 
showers dominate the data collected by these telescopes. 
Extrapolating the cosmic-ray and 7-ray signals from re- 
cent low-energy H.E.S.S. observations at 300 GeV [7^ 
and using the best estimates for the H.E.S.S. -II instru- 
mental characteristics [73], we find that 86% of the to- 
tal background signal will stem from cosmic-ray back- 
grounds. Thus, for ACT observations we create a simu- 
lation composed of 4.35% signal photons, 13.15% diffuse 
background photons (following the Fermi PASS-7 Galac- 
tic diffuse model) and 82.5% isotropic background pho- 
tons. As in the case of Fermi-LAT observations, we allow 
the total number of signal photons to float using Pois- 
son statistics, and maintain a background which is 86% 
isotropic, and 14% diffuse. For H.E.S.S. -II observations 
we model a 4° square window around the GC, consistent 
with the smaller field of view of ACT instruments. 



D. Clustering Algorithm 

In order to classify the spatial morphology of pho- 
tons in a statistically robust way, we employ the Density 
Based Spatial Clustering of Applications with Noise (DB- 
SCAN) algorithm [73], which is capable of both distin- 
guishing cluster points from noise and constraining the 
maximum connectivity size based on the instrumental 
point-spread function. DBSCAN possesses two input pa- 
rameters, corresponding to the assumed radius (e) of each 
cluster neighborhood and the number of points (Nmjn) 
which must be contained within a neighborhood to form 
a new cluster or add to an existing cluster. Our imple- 
mentation of DBSCAN, modified from the Scikit-Learn 
python package [75], works as follows: 

1. For each point in the input list, define the 'e- 
neighborhood' as a circle of radius e centered on 
the point of interest. 

2. If a point's e-neighborhood contains greater than 
A'^min points a new cluster is formed and that point 
is marked as a 'core point'. 

3. Two core points are considered 'density-connected' 
if the points are mutually contained with in each 
other's e-neighborhoods. All density connected 
points are then merged into a single cluster. 

4. Require any cluster to contain at least 3 core points. 

Traditional DBSCAN implementations also define the 
notion of 'density reachable' to indicate points which are 
not themselves density-connected, but which lie within 
a core point's e-neighborhood. This property is, how- 
ever, not symmetric, and cluster assignment in general 
depends on the input ordering of the data. Our algorithm 
ignores density-reachable points, thus ensuring determin- 
istic results. While we assume, in this analysis, that each 
profile is centered at the position of the GC, our results 
are independent of this assumption as the DBSCAN al- 
gorithm focuses on the relative position between photons, 
oblivious to any zero point of the profile. We note that 
a slightly modified DBSCAN algorithm has already been 
employed on Fermi-LAT data in the past [75]. 



FIG. 1: Event map of Fermi photons between 120-140 GeV (left) and a sample 3 pulsar Monte Carlo simulation (right) showing 
in colored circles the DBSCAN e— neighborhoods for core points in each detected cluster. 



To exemplify the use of the DBSCAN algorithm, Fig- 
ure [T] shows the DBSCAN analysis of the Fermi-LAT 
photon events measured with an energy between 120 ~ 
140 GeV (left), and of a simulated model containing two 
point sources near the galactic center (right). In each 
case, we show the DBSCAN e-neighborhoods for each 
core point of each detected cluster. For the Fermi results, 
DBSCAN finds only one cluster (interestingly centered 
on the actual Galactic center location!), while in the 3 
pulsar simulation case, the algorithm correctly identifies 
three clusters, at the positions corresponding to where 
the pulsar photons were generated. The following dis- 
cussion explains in detail the procedure we employ to 
apply DBSCAN to 7-ray data. 

To quantitatively compare our models against Fermi 
data, we follow Tramacere and Vecchio [75] and employ 
the likelihood ratio proposed by Li and Ma fTTl to calcu- 
late the cluster significance, s, in terms of the number of 
cluster photons Ng and background photons Ni,: 
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Ns + Nb 



Nbln 



2Nh 



Ns + Nb 



(4) 



Here, Nb represents the expected background counts, de- 
termined by integrating a diffuse background model (dis- 
cussed in Subsection II C I , while Ng is based on the to- 



tal photon count contained in the cluster; we effectively 
adopt a = 1 in the notation of Ref. [77]. According to 
Ref. [77], as long as Ng and Nb are not too sparse, one 
can equate a cluster with significance s to an "s-standard 
deviation observation" . Thus a cluster significance s = 2 
implies the cluster is a 2a fluctuation above the mean 
background as computed in Subsection |IIC[ We will use 
this nomenclature in our analysis. With the individual 
cluster significance in hand, we now define the "global" 
significance S as the mean significance of each detected 
cluster weighted by the number of photons in that cluster. 
We then optimize the choices of the DBSCAN parame- 



ters e and N^i^ for ACT simulations by maximizing the 
global significance for the clustering results from our pul- 
sar simulations. Finally we explain why this optimization 
procedure does not work with the limited Fermi photon 
count at 130 GeV and choose appropriate DBSCAN pa- 
rameters based on the Fermi spatial point spread func- 
tion. 

The value of e must be large enough that true cluster 
elements are not excluded, but small enough that noise 
is not included. The variable e, as a result, is closely 
tied to the physical size of the instrumental PSF. One 
must additionally choose a value Nmin large enough such 
that the background does not easily fluctuate above this 
number, but low enough that one has a high efficiency of 
finding real clusters. To this end, we use simulations of 
1, 2, and 3 pulsar models for Fermi simulations, and 2, 4, 
and 6 pulsar models for ACT observations to determine 
a region of (e, A^min) parameter space which simultane- 
ously optimizes the significance and detection efficiency. 
Displayed in the top row of Figure [2] is the global cluster- 
ing significance (shown in filled contours) and number of 
detected clusters with s > 1.29 (inset, labeled contours) 
for 48 photon Fermi simulations of 1, 2, and 3 pulsars as 
a function of the DBSCAN parameters e and Nj^^i^. In 
the bottom row, we again plot the global clustering sig- 
nificance and number of detected clusters with s > 2 for 
5000 photon ACT simulations with columns from left to 
right corresponding to 2, 4, and 6 pulsar models. We note 
that we apply a firm cut that A'min must be at least 3, as 
this represents the lowest possible non-trivial clustering 
which may be analyzed by the DBSCAN algorithm. 

Inspection of the all three columns for both Fermi and 
ACT simulations reveals that the clustering algorithm 
detects clusters at high significance over large coincident 
regions of DBSCAN parameter space. In the case of ACT 
observations where the clusters are better differentiated 
from the background, we also see that these regions also 
detect the correct number of clusters until the number of 




FIG. 2: Global cluster significance S (filled colored contours) and total number of clusters A'ciustcrs for Fermi (ACT) simulations 
(above threshold s > 1.29 (s > 2.0); labeled contours) as a function of the DBSCAN search radius e and core-point threshold 
A^min- The top row corresponds to Fermi simulations of 1 (left), 2 (center), and 3 (right) pulsar models while the bottom row is 
for ACT observations of 2 (left), 4 (center), and 6 (right) pulsar models. Results are relatively insensitive to large coincident 
regions in the scan parameter space for left and center columns, while the dependence on Nmin increases as the number of 
photons per pulsar approaches the background rate. Fermi simulation DBSCAN parameters are chosen to be e = 0.35, A'min = 3 
while ACT simulation DBSCAN parameters are chosen to be e = 0.05, A^'min = 8 as a compromise between the cluster detection 
efficiency and the significance over background. 



pulsars becomes to large to reliably detect all true clus- 
ters. This indicates that the results are robust for most 
reasonable choices of scan parameters while in the case 
of 6 pulsars, the number of detected clusters is somewhat 
more sensitive to parameter choices. For these ACT sim- 
ulations, we see that choosing iV,„in too small, or e too 
large can lead to the identification of extra (false) clusters 
which lowers the overall significance. 

We choose our scan parameters based on the 6 pulsar 
simulations (bottom right) which have the lowest signal 
to noise ratio among the models we consider here. These 
considerations motivate a choice of e = 0.35°, A^min = 3 
for Fermi simulations and e — 0.05°, A^min = 8 for ACT 
projections as a balance between preserving significance 
and detecting most of the clusters at s > 2. We note 
that detailed studies on the behavior of DBSCAN set- 
tings applied to Fermi-LAT data at lower energies have 
found qualitatively comparable optimization regions for 
DBSCAN parameters [75]. 

In summary, for ACT observations we expect ^5000 
photons for a 6h exposure, and use our significance mea- 
sure balanced against the number of detected clusters to 



optimize the DBSCAN parameters. We find e = 0.05° 
and Nmin — 8. For Fermi observations, we choose 
e = 0.35° and N„ii„ = 3, which represents the low- 
est level of non-trivial clustering. We note that there 
are only 48 photons in our sample, and thus we do not 
expect to be able to identify more than a few clusters cor- 
responding to point sources with our analysis technique. 



III. RESULTS 

In order to compare our models of the expected 
130 GeV line signal produced by both dark matter and 
pulsars, we first calculate the clustering properties of the 
actual Fermi dataset in the energy range of 120-140 GeV 
using the DBSCAN algorithm. We find only one detected 
cluster with a significance of s — 1.29, an angular scale of 
0.22° (defined as the mean pairwise distance of each pair 
of cluster members), and 3 member photons (see Fig. n] 
left). 

We first define the parameters useful for differentiat- 
ing different emission classes and then compare to current 
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FIG. 3: Models for the clustering properties expected from both Fermi-LAT (48 photons total, left) and H.E.S.S.-II (5000 
photons total, right) observations of annihilating dark matter following a NFW profile (blue dashed), flat density profile (green 
dashed), Einasto profile (red dashed) and decaying dark matter following an NFW profile (cyan dashed), as well as models of 
emission from undetected groups of one (magenta solid), two (yellow), three (black), four (blue), five (green), and six (red) 
pulsars, compared to the clustering properties observed in the Fermi-LAT data binned from 120-140 GeV (magenta dot dash). 
The top row shows the distribution of global significance of detected clusters (S, top row). All other quantities are calculated 
in the subspace of clusters with significance s > 1.29 (s > 2) for Fermi (ACT) simulations. Shown is the distribution of mean 
clustering radii (2nd row), the distribution of the total number of clusters detected (3rd row), and the distribution of the 
average number of member photons in each cluster with s > 2 (bottom). 



Fermi data and projections for upcoming H.E.S.S.-II ob- 
servations. In addition to the global significance, S, we 
define three quantities in the space of clusters with signifi- 
cance s > 1.29 for Fermi and s > 2 for ACT observations. 
The quantities we employ are chosen to best capture the 
results of each simulation based on the output of DB- 
SCAN, and provide useful information on the clustering 
properties of each source model. Specifically, we use: 

1. the mean clustering radius, rdustcr, defined as the 
average of the mean pairwise distance (angular 
scale) of each cluster above threshold, weighted by 
the cluster's significance; 

2. A'ciustors defined as the total number of clusters de- 
tected above threshold with at least 3 (10) cluster 
members for Fermi (ACT), and 

3. iVijioijibcrs defined as the mean number of photons of 
clusters above threshold, weighted by each cluster's 
significance. The significance weighting is used sim- 
ply to suppress the influence of clusters which are 
likely background fluctuations. 

In Figure [3] we show the results of the DBSCAN algo- 
rithm applied to both the Fermi data (vertical dashed- 
dotted line), compared to results from Monte Carlo sim- 
ulations of dark matter and point source emission from 
pulsars for 48 photons (left; resembling Fermi-LAT ob- 
servations) and 5000 photons (right; resembling ACT 
observations). The dashed lines correspond to diffuse 
dark matter annihilation models - NFW (blue dashed), 
Einasto (red dashed), NFW decay model (cyan dashed) 
and a flat distribution (green dashed) . Pulsar models are 
represented by solid lines - 1 (magenta), 2 (yellow), 3 
(black), 4 (blue), 5 (green), and 6 (red). Each histogram 
is normalized and if no clusters are found, the significance 
defaults to zero. We thus see that it is much less likely 
for diffuse models to produce clusters dense enough to be 
picked up by DBSCAN, indicating that our combination 
of e and A^min arc reasonably efficient at rejecting spuri- 
ous background clusters, especially in the ACT case. 

The global cluster significance S (top row of Figure pi) 
provides the strongest metric for differentiating point- 
source from diffuse emission. Diffuse models should pos- 
sess virtually identical clustering properties as the ex- 
tent of any true structure is much larger than the in- 
strumental point spread function. Thus, diffuse sources 
should only occasionally produce low significance, loosely 
grouped clusters due to background ffuctuations. One 
possible exception for dark matter models is the identifi- 
cation of a single point source at the GC where the dark 
matter annihilation rate can be very large. It is notable 
that the single cluster found in the Fermi data lies pre- 
cisely on the galactic center. We see in the case of Fermi 
simulations (top left panel) that there is a fairly sharp 
cutoff in the fraction of models with global significance 
S ^ 1. This lower bound is set by DBSCAN's minimum 
cluster detection requirements, as well as the location 
of the cluster with respect to the background template. 



Model 



/-Fermi-LAT 
/3>1.29 



/■Fermi-LAT 
/s>1.29 



/•ACT 
7s>2.0 



/■ACT 
7s>2.0 



/■ACT 
Js>2.0 



1 n 



NFW 


0.362 


0.059 


0.060 


0.002 


0.0 


Einasto 


0.335 


0.052 


0.039 


0.001 


0.0 


NFW Decay 


0.264 


0.031 


0.034 


0.001 


0.0 


Flat 


0.257 


0.030 


0.031 


0.001 


0.0 


1 Pulsar 


0.992 


0.197 


0.984 


0.003 


0.0 


2 Pulsars 


0.971 


0.681 


0.997 


0.902 


0.030 


3 Pulsars 


0.934 


0.631 


0.998 


0.967 


0.726 


4 Pulsars 


0.897 


0.556 


0.997 


0.956 


0.769 


5 Pulsars 


0.851 


0.485 


0.993 


0.942 


0.730 


6 Pulsars 


0.811 


0.424 


0.985 


0.888 


0.635 



TABLE I: The fraction of simulations with at least 1 cluster 
of significance s > 1.29 (n = 1 and s — 1.29, corresponding 
to the maximum n and Si for clusters i found in Fermi-LAT 
data) for Fermi-LAT simulations (column 2), the of Fermi- 
LAT simulations with at least 2 clusters of significance s > 
1.29 (column 3) and fraction of simulations with at least n 
clusters detected at significance s > 2.0 with at least 10 core 
points for ACT simulations (columns 4-6). 



which determines the number of background photons in 
that region. Because the cluster detected in the Fermi 
data is loose, we expect the single detected Fermi clus- 
ter to be close to the effective cutoff {S = 1.29 for the 
detected cluster). 

The second row of Figure [3] shows the mean clustering 
radius. For ACT observations there is a clear division 
between the point source and smaller diffuse emission 
scales. However, the Fermi-LAT simulations do not of- 
fer any useful discrimination between models with such 
low photon counts. For point sources, this distribution is 
governed by the average value of the PSF and possesses 
an asymmetric tail at larger scales due to the inclusion 
of background photons and events whose true position is 
determined by the long PSF tails. For diffuse models, the 
distribution is governed dominantly by the e DBSCAN 
parameter until the background density becomes domi- 
nant. 

Displayed in the third row is the distribution of the to- 
tal number of clusters, -/Vciustcrs, found with significance 
s > 1.29 (s > 2). In the case of ACT observations we 
also require clusters to have at least 10 core points to 
be included whereas we required only 3 for Fermi. We 
expect to be able to discern at most Nsig/N^i^ point 
sources if the signal photons happen to distribute them- 
selves evenly between sources. Even in this case, these 
true clusters may still lie below the significance thresh- 
old. It is realistic to identify only 2-3 true clusters with 12 
signal photons (48 total) if they in fact have point source 
progenitors. This is reflected here. For diffuse sources, 
only a fraction the detected clusters pass the significance 
cut. As the number of events is increased, this signifi- 
cance cut could be increased to maintain a high accep- 
tance/rejection ratio, though it is clear that for diffuse 
models, we typically obtain either zero clusters or one 
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cluster per simulation with our current significance cuts, 
while still efficiently detecting several clusters in the case 
of pulsar models. 

Finally, the fourth row contains the distribution of the 
mean number of cluster members, A'mcmbors- For a ran- 
dom distribution of events between pulsars, we expect 
a Poissonian distribution with a mean of approximately 
Nsig/Npuisars (contributions from the background should 
typically be <1 photon for Fermi, but the number of 
signal photons can fluctuate significantly during Pois- 
son sampling). For Fermi- LAT observations we expect 
12 ± 3.5 signal photons distributed between Np^isars- In 
the case of ACT observations, we expect 232 ± 15 pho- 
tons. Occasional spurious clusters will force this distri- 
bution downwards, although this effect is reduced by the 
significance weighting and because we only consider "core 
points" to be cluster members, thus rejecting those lying 
on the cluster boundaries. 

In order to quantify in how confidently point-source or 
diffuse models for the 130 GeV excess can be rejected, we 
count the fraction of simulations which are incompatible 
with Fermi-LAT data for each tentative source class. A 
simulation is deemed incompatible if at least 1 cluster 
is detected at significance s > 1.29 corresponding to the 
maximum number and significance of clusters detected in 
the Fermi-LAT data. The first column of Table U shows 
the fraction of simulations for each model with at least 
one cluster [n — 1) with Si > 1.29. In column 3 we 
show the fraction of simulations which have two clusters 
detected with a significance of Si > 1.29 in Fermi-LAT 
simulations, which further demonstrates the vast statis- 
tical separations between the clustering properties of dif- 
fuse and point-source models. In the subsequent columns 
we show similar data for the H.E.S.S.-II telescope, which 
also requires a cluster to have at least 10 core points, 
and clearly provides an ever greater ability to differenti- 
ate between source classes producing the 7-ray line. 

Our statistical approach shows that Fermi-LAT data 
already rule out models where the 130 GeV 7-ray line is 
produced by 1, (2, 4) or fewer pulsars at the 99% (95%, 
90%) confidence level (CL). Specifically, only one cluster 
was detected in the Fermi-LAT data with a significance 
s — 1.29, while a cluster with a larger significance is 
observed in more than 90% of simulations with any en- 
semble of less than 4 point sources. Due to the greatly 
increased effective area of the H.E.S.S.-II telescope, we 
find an even greater statistical separation between our 
models of diffuse and point source emission. This indi- 
cates that H.E.S.S.-II will be able to conclusively differ- 
entiate models of the 130 GeV line using purely statistical 
properties. 



IV. DISCUSSION AND CONCLUSIONS 

If confirmed, the tentative detection of a 7-ray line in 
the Fermi data might potentially turn into one of the 



most important breakthroughs on physics beyond the 
Standard Model, pointing towards the mass of the dark 
matter particle. Key future developments include the 
analysis of Fermi 7-ray events with the forthcoming Pass 
8 version of the Fermi-LAT analysis software, which will 
include a major overhaul of the energy reconstruction al- 
gorithm [7S]. It will also be crucial to identify whether 
the excess events at 130 GeV from the Earth's limb are 
indeed a statistical fluke. This question will be answered 
by increased exposure, which will increase the current 
statistical sample [THl [2H] ■ 

At present, barring instrumental effects, a 130 GeV 
line could be ascribed to either new physics, presumably 
a dark matter particle decaying or pair-annihilating into 
a 27 (or 7Z, 7h etc.) final state, or to one or more pulsars 
featuring a cold wind with electrons with an energy at, or 
very close to, 130 GeV. This latter possibility, it is argued 
in Aharonian et al. [50] j is the only "traditional" astro- 
physical process envisioned thus far that could produce 
a sharp gamma-ray line in the energy regime of interest. 
It is therefore of the utmost importance to discriminate 
between a cold pulsar wind scenario and a dark matter 
scenario, if indeed the line is resilient to future tests and 
observations. 

Discriminating dark matter and pulsar interpretations 
of the 130 GeV line may not be possible based solely on 
the spectral characteristics of the Fermi-LAT data. In 
the present study we sought to use morphological in- 
formation, i.e. the 130 GeV events' arrival direction, 
to establish whether the signal is likely due to multiple 
point sources as opposed to a truly diffuse origin. This 
is a meaningful question, since the signal region is much 
larger than the instrumental angular resolution, and it 
should be thus possible to discriminate a finite number 
of point sources, as expected in the pulsar case, from a 
distribution that follows a diffuse morphology, such as 
what expected from dark matter annihilation or decay. 

To quantitatively approach the issue of discriminat- 
ing pulsars versus dark matter on a morphological basis, 
we employed the DBSCAN algorithm which distinguishes 
clusters from background noise based on the local pho- 
ton density. We defined a statistical significance mea- 
sure, and we optimized the algorithm's two physically 
well-constrained parameters in order to reconstruct as 
accurately as possible the potential "clusters" producing 
the observed 7-ray events. 

As a result of our analysis of the available Fermi-LAT 
data, we concluded that at the 99%, 95%, 90% confidence 
level the data need at least 2, 3, 5 or more point sources, 
respectively, while the events' morphology is perfectly 
consistent with various dark matter density profiles. We 
conclude that the data strongly disfavor the hypothesis 
of a small number of pulsars as the origin of the signal. If 
the pulsar scenario is indeed the culprit for the 130 GeV 
events, it is necessary to postulate a relatively large popu- 
lation of pulsars (likely at least 4) with cold winds featur- 
ing electrons with exactly, to within the instrumental en- 
ergy resolution, the same energy. This appears, to say the 



11 



least, quite problematic. A diffuse origin seems therefore 
the most likely scenario for the 130 GeV photons. Our 
clustering algorithm approach, clearly, is not optimized 
to discriminate between different diffuse morphologies: in 
fact, on the basis of our results, we find that we cannot 
discriminate between different diffuse morphologies (like 
dark matter annihilation vs. decay). 

Present and future observatories have the potential to 
shed additional light on the presence and characteristics 
of the 130 GeV line [79]. Improvements to the H.E.S.S. 
telescope (H.E.S.S. -II) have reduced the 7-ray threshold 
to around 50 GeV, allowing for the independent deter- 
mination of a line signal from the GC region. This is 
especially important, as the 10'' m^ collecting area of the 
H.E.S.S. telescope will quickly alleviate the low-statistics 
issues involved in Fermi-LAT studies |60) . Furthermore, 
future instruments such as Gamma-400 [80] and CTA [81] 
are likely to provide the necessary effective area and 
energy-resolution to definitively and conclusively test the 
existence and nature the 130 GeV line feature. 



However, in the near future, the most important contri- 
bution is expected to come from Fermi-LAT itself: Ad- 
ditional data taken since last year, and the continuous 
accumulation of more data over the next years, will show 
whether the signature persists or is a rare statistical fluke. 
Simultaneously, the availability of pass 8 events, based on 
a set of completely rewritten event reconstruction algo- 
rithms for the LAT, will allow a fresh look on possible 
instrumental systematics. 
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